Crossover from Scale-Free to Spatial Networks 
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In many networks such as transportation or communication networks, distance is certainly a 
relevant parameter. In addition, real-world examples suggest that when long-range links are existing, 
they usually connect to hubs-the well connected nodes. We analyze a simple model which combine 
both these ingredients — preferential attachment and distance selection characterized by a typical 
finite 'interaction range'. We study the crossover from the scale-free to the 'spatial' network as 
the interaction range decreases and we propose scaling forms for different quantities describing the 
network. In particular, when the distance effect is important (i) the connectivity distribution has a 
cut-off depending on the node density, (ii) the clustering coefficient is very high, and (iii) we observe 
a positive maximum in the degree correlation (assortativity) which numerical value is in agreement 
with empirical measurements. Finally, we show that if the number of nodes is fixed, the optimal 
network which minimizes both the total length and the diameter lies in between the scale-free and 
spatial networks. This phenomenon could play an important role in the formation of networks and 
could be an explanation for the high clustering and the positive assortativity which are non trivial 
features observed in many real- world examples. 

PACS numbers: 89.75.-k, 89.75.Hc, 05.40 -a, 89.75.Fb, 87.23. Ge 
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Even if some networks are defined without any ref- 
erence to an embedding space, it is not the case for 
most real- world networks. Most people have their friends 
and relatives in their neighborhood, transportation net- 
works depend obviously on distance, many communica- 
tion networks devices have short radio range [1-3], and 
the spread of contagious diseases is not uniform across 
territories. A particularly important example of such a 
spatial network is the Internet which is defined as a set 
or routers linked by physical cables with different lengths 
and latency times [4]. From these examples, it appears 
important to define a realistic model in which nodes and 
links are embedded in a physical space which induces a 
distance between nodes, and we will designate these net- 
works as 'spatial'. More generally, the distance could 
be another parameter such as a social distance measured 
by salary, socio-professional category differences, or any 
quantity which measures the cost associated with the link 
formation. If the cost of a long-range link is high, most of 
the connections starting from a given node will link to the 
nearest neighbors. When a long-range link is existing, it 
will usually connect to a well-connected node — that is, a 
hub. This is for instance the case for airlines: Short con- 
nections go to small airports while long connections point 
preferably to big airport (ze. well connected nodes). This 
propensity to link to an already well connected node has 
been coined preferential attachment [5,6]: The probabil- 
ity to link to a node is proportional to the connectivity 
of it. It is widely accepted that preferential attachment 
is the probable explanation for the power-law distribu- 
tion seen in many networks [6]. However, even if this 
process generates reasonably realistic networks, it misses 
the important element of the cost of links. We will study 



a simple model which incorporates such a cost with a 
typical scale rc- 

Several models including distance were previously pro- 
posed [7-14] but the case of preferential attachment with 
a finite scale rc was not considered before and more gen- 
erally, the study of the interplay between preferential 
attachment and distance effects is still lacking. In this 
Letter, we study this interplay and we demonstrate the 
existence of a crossover from the scale-free network to 
the spatial network when the interaction range decreases. 
In particular, we propose scaling forms for the different 
quantities which characterize the network. 

The N nodes of the network are supposed to be in a 
rf-dimensional space of linear size L and we will assume 
that they are distributed randomly in space with uni- 
form density p. One could use other distributions: For 
instance in cities the density decreases exponentially from 
the center [15]. The case of randomly distributed points 
is interesting since on average it preserves natural sym- 
metries such as translational and rotational invariance in 
contrast with lattices. For the sake of simplicity, we will 
choose for our numerical simulations the two-dimensional 
plane and the Euclidean distance. Once the nodes are 
distributed in this space, we have to construct the links 
and we use the following algorithm: (1) Select at random 
a subset of uq initial active nodes. (2) Take an inactive 
node i at random an connect it with an active node j 
with probability (up to a normalization factor) 
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(1) 



where kj is the connectivity of node j, dij is the dis- 
tance between nodes i and j, and Z and A are given 
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functions. Finally (3), make the node i active and go 
back to (2) until all nodes are active. For each node, 
we repeat m times the steps (2 — 3) so that the average 
connectivity will be (fc) = 2m (numerically wc choose 
m = 3). There are essentially three different interesting 
cases: (i) Preferential attachment When Z{k) = k + 1, 
and A = const., wc recover the usual preferential at- 
tachment problem [6]: The connectivity distribution is 
a power law with exponent 7 = 3, the shortest path £ 
is growing with N as £ ^ logiV, the assortativity is de- 
creasing as \og^ N/N [16], and the clustering coefScient 
is decreasing as 1/N [17]. (ii) Distance selection: In this 
case Z = const.: There is a distance effect only [2,10-12]. 
Jost and Joy [12] studied different functions A(d), Dall 
and Christensen [10] studied graphs in which each ver- 
tex is randomly located and connected with its adjacent 
points, while in [11], the authors study a (small- world) 
network constructed from re-wiring links with a proba- 
bility that decreases as a power-law with distance, (iii) 
Preferential attachment and distance selection: It is the 
case where Z{k) = k + 1 and A is a increasing function 
of the distance. In most cases — such as transportation 
networks or social interactions — the range of interaction 
is limited, which is explained by the fact that there is a 
cost associated to long range links. Up to our knowledge, 
studies done so far in this case were concerned with the 
case where A decreases as a power-law [13,14,7]. The 
main result is then the existence of different regimes ac- 
cording to the value of the exponent describing the spa- 
tial decay of A. In contrast, in this Letter we will study 
the finite-range case for which the function A is negligible 
above a finite scale rc, a prototype being the exponential 
function 

A{d) = e'^/^- (2) 

Even if there is some controversy on the spatial decay of 
the linking probability for Internet cables [4,7], it seems 
that the range is relatively short and that when a new 
server (or router) adds to the network, it will connect 
preferably to the nearest node(s). One of the most impor- 
tant model of Internet topology relying on this argument 
and using Equ. (2) is the Waxman topology generator 
[18] and the model considered here thus appears has the 
natural generalization of the Waxman case. 

When the interaction range is of the order of the sys- 
tem size (or larger), we expect the distance to be irrele- 
vant and the obtained network will be scale-free. In con- 
trast, when the interaction range is small compared to the 
system size, we expect new properties and we would like 
to understand the crossover between these two regimes as 
well as the scaling for the different quantities describing 
the network. 

Probability distribution. We did simulations for this 
model and as expected, when r] = rc/L is larger than 
one, the distance selection is in-operant and we recover 



the usual scale-free network obtained by preferential at- 
tachment. In particular, the probability distribution is 
a power law with exponent 7 = 3 independently of the 
actual value of rj (not shown). In the opposite situation 
ry <C 1, the distance is relevant and we expect a cut-off in 
the (one minus the) cumulative connectivity distribution 

F(fc)~fc-^+V(^) (3) 

where kc is the cut-off at large connectivity: /(a; » 1) ~ 

(sec figure la). In this problem, we have two dimension- 
less parameters: The number of nodes and rj. Our 
guess — a posteriori verified — is that the control parame- 
ter is the average number of points present in a sphere of 
radius rc (and of volume Vd{rc)) as given by n = pVd{rc). 
We thus propose the following scaling ansatz for the cut- 
off valid only for 77 ^ 1 

fce ~ (4) 

In order to test these assumptions, we first plot the cu- 
mulative distribution for different values oi N, rc and 
L for d = 2 (see Fig. lb). We then use our scaling 
ansatze (3), (4) and we indeed obtain a good data col- 
lapse (Fig. Ic) with f3 ~ 0.13. According to these results, 
the distance effect limits the choice of available connec- 
tions thereby limiting the connectivity distribution for 
large values. More generally, the distance effect induces 
correlations which are known to affect the connectivity 
distribution [19]. 

Clustering coefficient. The clustering coefficient C is 
defined as the average over all nodes of the percentage of 
existing links between neighbors [20]. For the scale-free 
network C is small and decreases as 1/N. In contrast, 
when the distance effect is important we expect a higher 
cluster coeflBcient. Indeed, if two given nodes i and j are 
connected it means that the distance dij is less or of the 
order of rc. In the process of adding links, if a new node 
k links to i, it means also that dki < rc. This implies 
that j and k belongs to the disk of cc;nter i and of radius 
rc. The probability that k and j will link will depend on 
the distance djk- When we can neglect the preferential 
attachment and if we suppose that a given node is con- 
nected to all its neighbors, the probability that k is also 
linked to node j is given in terms of the area of the in- 
tersection of the the two spheres of radius rc centered on 

1 and j respectively. This is a simple calculation done in 
[10] and predicts that for = 2 the clustering coefficient 
is Co = 1 — 3-\/3/47r ~ 0.59. We expect to recover this 
limit for 77 ^ and for an average connectivity (k) — 6 
which is a well-known result in random geometry [21]. 
If T] is not too small, the preferential attachment is im- 
portant and induces some dependence of the clustering 
coeflBcient on N. In addition, we expect that C will be 
lower than Co since in our model the links don't connect 
necessarily to the nearest neighbors. We first compute 
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C versus the number of nodes TV for different values of 
r] (not shown) and the same data versus n (Fig. 2a) fall 
nicely on the same curve which is a decreasing function. 
In order to understand this variation, we consider one 
given node iq. When n is above 1 and increasing, the 
number of neighbors of the node io will increase and the 
probability that two of them will be linked is thus de- 
creasing which explains the monotonic decay of C{n). 

Assortativity. We also compute the assortativity A of 
the network which measures the correlation between the 
degree of the nodes. In this case, one way to measure this 
correlation is the Pearson correlation coefficient of the de- 
grees at either ends of an edge [23] . This quantity varies 
from —1 (disassortative network) to 1 (perfectly assorta- 
tive network) and is decreasing towards zero as log^ N/N 
for the scale- free network. In the case 77 <C 1, we compute 
the assortativity coefficient versus and we plot the re- 
sults versus n (Fig. 2b). The data fall onto a single curve 
and exhibit a maximum for n ~ 0.1 where the network is 
also very clustered. In addition, this maximum is positive 
which indicates that the hubs are connected and not dis- 
persed in the network. For very small n, the connectivity 
does not fluctuate and A~ while for large n preferential 
attachment dominates and A is also small. This could be 
a possible explanation for the maximum observed in the 
intermediate regime where distance selection and prefer- 
ential attachment coexist. It is interesting to note that 
the value of the maximum (Amax ~ 0.2) is in agreement 
with the empirical measurements for different networks 
[24] which give A e [0.20,0.36]. The cost of Unks thus 
induces positive correlations and could provide a simple 
explanation to the positive assortativity observed in so- 
cial networks (actors, collaboration, or co-authorship). 
This fact is also very important from the point of view of 
the resilience of the network since it is very sensitive to 
the degree correlation [24]. In particular, for the scale- 
free network, deleting hubs when A > is not as efficient 
when A ~ or A < [25] . This means that for this type 
of networks, one has to adapt the best attack strategy to 
the value of the density. 

Diameter. An important characterization of a network 
is its diameter i [22] : It is the shortest path between two 
nodes averaged over all pairs of nodes and counts the 
number of hops between two points. 

We propose the following scaling ansatz which describe 
the crossover from a spatial to a scale-free network 



e{N,T]) = [iV*(r?)]"$ 



TV 



N*{r]) 



(5) 



with $(a; <C 1) ~ and $(x > 1) ~ logo;. The typical 
size A^* is depending on t] and its behavior is a priori com- 
plex. However, we can find its behavior in two extreme 
cases. For 77 1, space is irrelevant and 



where Nq is a finite constant. When 77 ^ 1, the exis- 
tence of long-range links will determine the behavior of 
£. If we denote by a = l/p^/'* the typical inter- node dis- 
tance, we have to distinguish two regimes: If Tc ^ a then 
long-range links cannot exist and therefore N <C N*. If 
Vc ^ a, long-range links can exist and we are in a small- 
world regime N ^ A'^*. This argument implies that A''* 
is such that Tc ^ a which in turn implies 



Ar*(,7«l)^l 



(7) 



In Fig. 3a, we use the ansatz Equ. (5) together with the 
results Equ. (6), (7). The data are collapsing onto a sin- 
gle curve showing the validity of our scaling ansatz. This 
data collapse is obtained for a ~ 0.31 and A^o — 180 (for 
d = 2). The scale- free network is a 'small' world: the di- 
ameter is growing with the number of points as ^ ~ log TV. 
In the opposite case of the spatial network with a small 
interaction range, the network is much larger: To go 
from a point A to a point B, we essentially have to pass 
through all points in between and the behavior of this 
network is much that of a lattice with £ TV", although 
the diameter is here smaller probably due to the exis- 
tence of some rare longer links (in the case of a lattice 
a = l/d). 

Total cost and optimization. Finally, we compute the 
average per node of the distances of all links 



links 



(8) 



TV*(?7> 1) ~ A^o 



(6) 



This quantity is a simple measure of the total cost of the 
network. For 77 <C 1, preferential attachment is irrelevant 
and the links distance is essentially distributed accord- 
ing to p(r) ~ exp(— r/ro) which leads to 5 ^ Lrj. In 
the opposite situation r] ^ 1, only preferential attach- 
ment is important and the links distance is distributed 
according to p{r) ~ (i7r''/^r''~^/r(fi/2-|-l) which gives for 
d = 2 S Ln/d (figure 3b). When 77 is increasing the 
total cost is thus increasing while TV* is decreasing. This 
implies that for fixed TV, the network which simultane- 
ously minimizes the total cost and the average diameter 
(ie. with small TV*) is non-trivial and lies in between 
the 'pure' scale-free network (ie. without distance effect) 
and the 'pure' spatial network (ie. with no preferential 
attachment). 

In summary, our results demonstrate the general im- 
portance of a cost in the formation of networks which 
induces a dependence of all quantities on the node den- 
sity. In addition, the optimal network which minimizes 
both the total length and the diameter (at fixed number 
of nodes) will lie in between the scale-free and the spatial 
network. This result could thus provide a simple expla- 
nation to the large clustering coefficient and the positive 
correlations between node degrees as it is observed in 
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some cases where creating a long link is costly such as 
social or transportation networks for example. 
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FIG. 1. (a) Cumulative distribution function for different 
values of N going from N = 10"^ to 10^ and for 77 going from 
1/400 to 1/200. (b) Data collapse for the same data of figure 
(b) with kc ~ n'' with f3 ~ 0.13. 
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FIG. 2. Clustering coefficient versus versus the mean num- 
ber n — pTir^ of points in the disk of radius rc (plotted in 
Log-Lin). The dashed line corresponds to the theoretical 
value Co computed when a vertex connects to its adjacent 
neighbors without preferential attachment [9]. (b) Assorta- 
tivity versus n (in Log-Lin) . This plot shows clearly a positive 
maximum for n ~ 0.1. Around this maximum, the network 
will be resilient even to a targeted attack against hubs. 
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FIG. 3. (a) Data collapse (in Log-Log) for ^(A, r]) using our 
scaling ansatz Equ. (5) together with Equs. (6), (7). The data 
collapse is obtained with 14 curves for r\ going from 1/500 to 
100 and for A up to 10^. The first part of the scaling func- 
tion exhibits a power law behavior with exponent q ~ 0.32, 
followed by a logarithmic behavior for N/N* ^ 1. (b) Total 
length of the network per node (and per distance) S/ L versus 
rj going from 1/100 to 10. This cost is first linearly increas- 
ing with ri and as the preferential attachment becomes more 
important it converges to 7r/6 (long-dashed line). 
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